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161 Discovering unexpected information from your competitors' web sites 
Bing Liu , Yiming Ma , Philip S. Yu 

Proceedings of the seventh ACM SIGKDD international conference on Knowledge 
discovery and data mining August 2001 

Ever since the beginning of the Web, finding useful information from the Web has been an 
important problem. Existing approaches include keyword-based search, wrapper-based 
information extraction, Web query and user preferences. These approaches essentially find 
information that matches the user's explicit specifications. This paper argues that this is 
insufficient. There is another type of information that is also of great interest, i.e., unexpected 
information, which is unanticipated by the use ... 



82% 



162 Constraints for semistructured data and XML 82 % 
Peter Buneman , Wenfei Fan , Jer&ocime Simeeon , Scott Weinstein 

ACM SIGMOD Record March 2001 
Volume 30 Issue 1 

Integrity constraints play a fundamental role in database design. We review initial work on 
the expression of integrity constraints for semistructured data and XML. 

163 Function-based object model towards website adaptation 82% 
Jinlin Chen , Baoyao Zhou , Jin Shi , Hongjiang Zhang , Qiu Fengwu 

Proceedings of the tenth international conference on World Wide Web April 2001 

164 Towards second and third generation web-based multimedia 82% 
Jacco van Ossenbruggen , Joost Geurts , Frank Cornelissen , Lynda Hardman , Lloyd 

Rutledge 
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165 The design and implementation of the redland RDF application framework 82% 
gj David Beckett 

Proceedings of the tenth international conference on World Wide Web April 2001 

166 Tools for application-oriented performance tuning 82% 
gj John Mellor-Crummey , Robert Fowler , David Whalley 

Proceedings of the 15th international conference on Supercomputing June 2001 

Application performance tuning is a complex process that requires assembling various types 
of information and correlating it with source code to pinpoint the causes of performance 
bottlenecks. Existing performance tools don't adequately support this process in one or more 
dimensions. We discuss some of the critical utility and usability issues for application-level 
performance analysis tools in the context of two performance tools, MHSim and HPCView, 
that we built to support our ... 



167 Tools for World Wide Web based legal decision support systems 82% 
gj Andrew Stranieri , John Yearwood , John Zeleznikow 

Proceedings of the 8th international conference on Artificial intelligence and law May 

2001 



The majority of legal knowledge based systems (LKBS) in commercial use are rule based and 
target domains of law characterized by large and complex statutes where modelling discretion 
is not a central concern. Furthermore, to date, few LKBS execute on the World Wide Web. 
Despite this, LKBS designed for a web environment can make law more universally 
accessible and transparent. Tools required to facilitate the development of web based systems 
include a web based expert system shell, conceptual ... 



168 Regular expression pattern matching for XML 

[jjjh Haruo Hosoya , Benjamin Pierce 

ACM SIGPLAN Notices , Proceedings of the 28th ACM SIGPLAN-SIGACT 
symposium on Principles of programming languages January 2001 
Volume 36 Issue 3 

We propose regular expression pattern matching as a core feature for programming 
languages for manipulating XML (and similar tree-structured data formats). We extend 
conventional pattern-matching facilities with regular expression operators such as repetition 
(*), alternation (I), etc., that can match arbitrarily long sequences of subtrees, allowing a 
compact pattern to extract data from the middle of a complex sequence. We show how to 
check standard notions of exhaustiveness and r ... 



169 Prototype for wrapping and visualizing geo-referenced data in a distributed environment 82% 

gj using XML technology 

Jianting Zhang , Muhammad Javed , Amir Shaheen , Le Gruenwald 

Proceedings of the eighth ACM international symposium on Advances in geographic 

information systems November 2000 

This paper proposes a prototype for integration and visualization of geo-referenced 
information (GRI) in a distributed environment in general and World Wide Web in particular. 
This prototype adopts a three-tier architecture and includes three main components: GRI 
wrapper for distributed GRI web sites, GRI integration mediator and client side visualization 
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In this prototype, XML is used as a communication protocol between distributed web sites 
that provide GRI and the mediat ... 



170 Requirements engineering for product families 82% 
Juha Kuusela , Juha Savolainen 

Proceedings of the 22nd international conference on Software engineering June 2000 
In search for improved software quality and high productivity, software reuse has become a 
key research area. One of the most promising reuse approaches is product families. However, 
current practices in requirements engineering do not support product families. This paper 
describes a definition hierarchy method for requirements capturing, structuring, analysis and 
documentation. This method helps to identify architectural drivers of the product family and 
shows how different products in the ... 



171 Multivalent documents 82% 
Qj Thomas A. Phelps , Robert Wilensky 

Communications of the ACM June 2000 

Volume 43 Issue 6 



172 On mutli-resolution document transmission in mobile Web 82% 
pft Stanley M. T. Yau , Hong Va Leong , Dennis McLeod , Antonio Si 

ACM SIGMOD Record September 1999 

Volume 28 Issue 3 

We propose a multi-resolution transmission mechanism that allows various organizational 
units of a web document to be transferred and browsed according to the amount of 
information captured. We define the notion of information content for each individual 
organizational unit of a web document as an indication of its captured information. The 
concept of information content is used as a foundation for defining the notion of relative 
informatio ... 



173 Technical papers: component technologies: Component rank: relative significance rank for 82% 
software component search 

Katsuro Inoue , Reishi Yokomori , Hikaru Fujiwara , Tetsuo Yamamoto , Makoto Matsushita 
, Shinji Kusumoto 

Proceedings of the 25th international conference on Software engineering May 2003 
Collections of already developed programs are important resources for efficient development 
of reliable software systems. In this paper, we propose a novel method of ranking software 
components, called Component Rank, based on analyzing actual use relations among the 
components and propagating the significance through the use relations. We have developed a 
component-rank computation system, and applied it to various Java programs. The result is 
promising such that non-specific and generic ... 



174 Research track: A bag of paths model for measuring structural similarity in Web documents 82% 
Sachindra Joshi , Neeraj Agrawal , Raghu Krishnapuram , Sumit Negi 
Proceedings of the ninth ACM SIGKDD international conference on Knowledge 
discovery and data mining August 2003 

Structural information (such as layout and look-and-feel) has been extensively used in the 
literatuce for extraction of interesting or relevant data, efficient storage, and query 
optimization. Traditionally, tree models (such as DOM trees) have been used to represent 
structural information, especially in the case of HTML and XML documents. However, 
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computationally expensive. wfHis paper, we propose an alternative ..^^ 



175 Short papers: Visualization of ontologies through hypertrees 82% 
Kleber X. S. de Souza , Adriana D. dos Santos , Silvio R. M. Evangelista 
Proceedings of the Latin American conference on Human-computer interaction August 
2003 

In this paper, we present the use of hypertree as a supporting tool for visualization of 
ontologies in agricultural domain. This kind of visualization technique was used in the 
Information Agency Project, in execution by the Brazilian Agricultural Research Corporation 
— Embrapa. The project's aim is to provide an information dissemination system structured in 
accordance to the productive chains of given products. That structure was chosen because it 
reflects the natural way technicians use to i ... 



176 Adaptive hypermedia (2): "Pluggable" user models for adaptive hypermedia in education 82% 
M. R. Zakaria , A. Moore , C. D. Stewart , T. J. Brailsford 

Proceedings of the fourteenth ACM conference on Hypertext and hypermedia August 
2003 

Most adaptive hypermedia systems used in education implement a single user model - 
inevitably originally designed for a specific set of circumstances. In this paper we describe an 
architecture that makes use of XML pipelines to facilitate the implementation of different 
user models. 



177 Technical papers: Learning programs from traces using version space algebra 82% 
Tessa Lau , Pedro Domingos , Daniel S. Weld 

Proceedings of the international conference on Knowledge capture October 2003 
While existing learning techniques can be viewed as inducing programs from examples, most 
research has focused on rather narrow classes of programs, e.g., decision trees or logic rules. 
In contrast, most of today's programs are written in languages such as C++ or Java. Thus, 
many tasks we wish to automate (e.g. programming by demonstration and software reverse 
engineering) might be best formulated as induction of code in a procedural language. In this 
paper we apply version space algebra [10] to ... 



178 Designing and accessing scientific digital libraries: On querying geospatial and georeferenced 82% 
metadata resources in G-portal 

Zehua Liu , Ee-Peng Lim , Wee-Keong Ng , Dion H. Goh 

Proceedings of the third ACM/IEEE-CS joint conference on Digital libraries May 2003 
G-Portal is a web portal system providing a range of digital library services to access 
geospatial and georeferenced resources on the Web. Among them are the storage and query 
subsystems that provide a central repository of metadata resources organized under different 
projects. In GPortal, all metadata resources are represented in XML (Extensible Markup 
Language) and they are compliant to some resource schemas defined by their creators. The 
resource schemas are extended versions of a basic resou ... 



179 Usage-based visualization of web localities 82% 
Boris Diebold , Michael Kaufmann 

Australian symposium on Information visualisation - Volume 9 December 2001 
The World-Wide Web has evolved into an extremely huge but "messy" information space 
which is hard to overview. Sitemaps as alternative views of Web sites have been proposed to 
assist the user in navigating the hyperspace. As Web localities are subject to frequent change 
and redesign, it is especially important to provide a system for automatic generation of such 
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180 Transformations and Experiences: Towards static type checking for XSLT 82% 
Akihiko Tozawa 

Proceedings of the 2001 ACM Symposium on Document engineering November 2001 
We are concerned about the static type checking problem for XSLT. In the context of XSLT 
and other XML programming, types are DTDs or schemas, and static type checking is to 
verify that a program always converts valid source documents into also valid output 
documents. To achieve static type checking for XSLT, we introduce a subset of XSLT, and an 
efficient algorithm of backward type inference for that subset. Although our XSLT subset 
lacks XPath, it includes recursiv ... 
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A preliminary study on strategic bidding in electri 
markets with step-wise bidding protocol 

Li Ma Wen Fushuan David, A.K. 
Zhejiang Univ., Hangzhou, China 

This paper appears in: Transmission and Distribution Conference and Ex 
2002: Asia Pacific. IEEE/PES 

Publication Date: 6-10 Oct. 2002 
On page(s): 1960 - 1965 vol.3 
Volume: 3 
ISSN: 

Number of Pages: 3 vol. 2377 
Inspec Accession Number: 7644267 

Abstract: 

The power industry of China is now being restructured and generation market 
expected to be established nationwide in 10-15 years. Zhejiang provincial ele 
market, as a pilot one, has been successfully operated for more than two yea 
electricity market environment, the profits of generation companies depend, t 
extent, on their bidding strategies. As a result, how to develop the optimal b 
strategy has become a major concern of generation companies. Given this ba 
model of bidding strategies based on Zhejiang provincial electricity market in 
step-wise bidding rules are utilized is developed in this paper. Rival bidding 
are described by a normal distribution function, and the problem of building t 
bidding strategy for a generation company is then formulated as a stochastic 
optimization problem, and solved by a Monte Carlo approach. A simple numer 
example with five suppliers is served for illustrating the essential features oft 
presented method. 
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